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ABSTRACT 

\^ ' We study the effects of large-scale density fluctuations on strong gravitational lensing. 

Previous studies have focused mostly on weak lensing, since large-scale structure alone cannot 
produce multiple images. When a galaxy or cluster acts as a primary lens, however, we find 
?-H , that large-scale structure can produce asymmetric shear of the same order as the lens itself. 

, Indeed, this may explain the origin of the large shear found in lens models in conflict with the 

small ellipticity of the observed galaxy light distributions. We show that large-scale structure 
changes the lens equation to the form of a generalized quadrupole lens, which affects lens 

■ reconstruction. Large-scale structure also changes the angular diameter distance at a given 
\^ • redshift. The precise value depends on the lens and source redshifts and on the large-scale 

■ structure power spectrum, but the induced la uncertainty in determinations of the Hubble 
constant from measurements of time delays is of order 5 — 10%. If observations of lensing can 
constrain the magnitude of the shear which is due to large-scale structure, it would provide a 

[ direct probe of the overall amplitude of mass fluctuations. 
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Subject headings: gravitational lensing — large-scale structure of universe 



1. Introduction 



X 

I Gravitational lensing is one of the most promising methods of mapping the distribution of matter at 

cosmological distances. Detailed observations of multiple images of quasars have been used to try and 
reconstruct the lensing mass distribution (e.g. Falco et al. 199l] ). It has also long been recognized that 



measurements of the time delay between images can be used to determine the Hubble constant ( Refsda] 



1964 1966). However, practical application to the double quasar 0957+561 has been difficult because of 



uncertainties in lens modelling as well as confiicting measurements of the time delay (e.g. Vanderriest et 



il. 19891 ; ILehar et al. 19921) . 

Since gravitational lenses and sources typically lie at significant redshifts, light rays are defiected 
by large-scale structure (LSS) as they traverse the enormous distance from source to observer. These 
defiections are not large enough to produce multiple images, but they do distort the shapes of sources. 
Such weak lensing has been investigated both analytically (e.g. Miralda-Escude 1991| ; Kaiser 1992| ) and 
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in N-body simulations (Jaroszynski et al. 1991; Blandford et al. 1991). These studies find a shear of 
order 1%, coherent over a scale of ~ 1°, in a flat CDM model. This shear may in principle be detected 
observationally as a coherent distortion of background galaxies, when averaged over a sufficiently large 



angular field in order to be separated from the random scatter of intrinsic ellipticities (e.g. Mould et al 
19^; IVillumsen 1995)) . 

The shear due to LSS can also affect strong lensing, when it acts in addition to a strong primary lens, 
a galaxy or cluster near the line of sight to the source. This effect is enhanced compared to weak lensing, 
because of the small angular separations between multiple images. Also, the higher redshift of quasars 
compared to faint galaxies increases the cumulative shear from the observer to the source. Seljak (1994) 
estimated the dependence of the r.m.s. value of this shear on the power spectrum of density fluctuations, 
and found it to be of order 10% for a source at redshift 3. Seljak also considered the effect of LSS on the 
time delay, and showed that the lowest order terms cancel out in the total time delay. However, since 
these canceling terms are separately much larger than the time delay from the primary lens, even higher 
order terms might still dominate the time delay and threaten the effort to determine the Hubble constant 
from lensing. 

In order to find precisely how LSS affects the observables of a lens system, Surpi et al. (1995) set up the 
lens equation in the presence of a lens plus LSS. They made an expansion for the position of a light ray 
in powers of its deflection from the unperturbed straight path, and kept only the lowest order term. This 
term is equivalent to a constant angular deflection at the lens. They thus concluded that LSS leaves all 
observables (such as relative image positions) unchanged to lowest order. Indeed, since the actual source 
position is unobservable, the effect of this lowest order term can be removed from the lens equation by 
subtracting the constant angle out of the source angle. This approximation of keeping the lowest order 
term is not a good one, however, since the shear due to LSS arises from relative deflections between 
different light rays, which involve higher order terms in the expansion. We follow a similar approach but 
include these higher order terms in order to study the observable effects of LSS. 

In this paper we analyze the effect of LSS on the lens equation and time delay. Readers primarily 
interested in the results may wish to concentrate on §4 - §6 and §8. We derive the lens equation in §2 
and §3, and find it to have a form similar to the generalized quadrupole lens of Kovner (1987) (§4). We 
express the perturbed lens equation in terms of integrals along the line of sight of the scalar, Newtonian 
potential. These integrals are random variables of zero mean, whose variances and covariances can be 
evaluated in terms of the power spectrum of density perturbations (§5). We find that the effective shear 
in the lens equation is not simply the integrated shear from the observer to the source, but is reduced 
by 40% or more, depending on the lens redshift. For realistic power spectra that include modelling of 
non- linear effects, the effective shear is of order 6% for a source at redshift 3. In addition, the accumulated 
shear from the observer to the lens can significantly affect the observables as well as the appearance of 
the lens itself, if the lens is at a relatively high redshift. In §3 we also discuss how our results determine 
the effect of LSS on angular diameter distances. 

The most important effect of shear is in producing four- image systems. Many confirmed lens sys- 
tems are quads, since they are easy to identify and tend to be highly magnified (Kochanek 1991b, 1995; 
Wallington & Narayan 1993). These systems are inconsistent with an axi-symmetric lens, for which all 
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the images would have to be cohnear. Lens models of quads typically find a shear of order 7 — 11% (e.g. 
Kochanek 1991a| ). If due to the lensing galaxy itself, this would imply a projected ellipticity (= 1 minus 
the ratio of minor to major axis) for the mass of ~ 35 — 50%. By contrast, the typical value observed 
for ellipticals is « 20% (e.g. Ryden 1992; Schechter 1987). Since the cross-section for producing quads 
increases with shear, observed quads should be biased towards high shear, whatever its origin (Kassiola 
Sz ^ovner 1993| ). In particular this includes a bias toward an alignment between the shear caused by 
the galaxy and the external shear. High resolution observations of lensing galaxies can determine the 
degree of agreement or inconsistency between the observed ellipticity and the inferred shear in specific 
cases. Recent observations of a four-image "Einstein cross" with the Hubble Space Telescope WFPC2 
( [Ratnatunga et al. 1995 ) found an ellipticity in the potential of 26%, which implies a mass ellipticity of 
60%. The light distribution was found to have an ellipticity of only 32%. One possible explanation is 
that the dark matter halo is highly flattened compared to the light distribution, but other observations 
may not support the existence of such large differences in typical galaxies (for a recent review see Sackett 
19^1^). Another possibility is that a LSS shear of order 8% has been added on to the 7% shear of the lens. 
In fact, the directions of the total shear and that due to the light distributions are different by about 
13°, so the disagreement is larger. A recent HST observation of a lensed arc ( Eisenhardt et al. 1995 ) 
has similarly found an observed ellipticity of about half that implied by the best fit lens model. Note, 
however that other possible sources of external shear, namely additional galaxies or clusters near the line 
of sight to the source, must be properly accounted for before the contribution of LSS can be determined. 

In §4 we also consider the effect of LSS on relative time delays of images. The related phenomenon 
of amplification of sources due to large-scale structure has been studied by Babul & Lee (1991), but 
not in the presence of a primary lens. We show that the effect on time delays is enhanced through a 
combination of two separate effects. LSS thus limits our ability to derive accurate values of the Hubble 
constant from lensing. The induced uncertainty depends on the lens and source redshifts and on the 
large-scale structure power spectrum, but in §5 we find it to be of order 5 — 10% at la. This uncertainty 
may have either sign since LSS may effectively produce a negative mass density (negative is measured 
w.r.t. the mean density of the universe, not w.r.t. zero). 

In §6 we choose a simple lens distribution, a singular isothermal sphere, and illustrate the effect of 
LSS on relative image positions and time delays, as well as the caustics and critical curves of the lens 
system. In §7 we apply our formalism to the transition from strong to weak lensing and demonstrate its 
agreement with previous studies of weak lensing (a detailed derivation is given in Appendix B). Finally, 
in §8 we give our conclusions and comment on possible applications of our results. 

We assume a flat universe throughout, in the absence of an accurate fitting formula for the time depen- 
dent, non-linear power spectrum in a curved background. Our formalism is, however, easily generalized 
to a closed or open universe, as we show in Appendix A. 
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2. Formalism 

We work in the framework of a flat Robertson- Walker metric with small-amplitude scalar metric 
fluctuations. In the longitudinal gauge ( Bardeen 19801 ) we can write the line element as 



ds'^ = a^{T)[-{l + 2(j))dT^ + {l-2(j))dx-dx] , (1) 

where we set c = 1. Here r is the conformal time, a(r) the expansion factor, and we are using comoving 
coordinates x. Redshift in a flat, matter-dominated {0,^ = 1) universe is given hy 1/(1 + z) = a(r) = 
, with Hq = 100/ikmsec~^ Mpc~^ the present Hubble constant. Also 4> is the scalar, Newtonian 
potential obeying the cosmological Poisson equation 

V2(/> = AirGa^p 5 , (2) 

where p is the mean density of the universe and 6 = (p—p) /p is the local density perturbation. We describe 
statistical properties of ip in terms of its Fourier transform cl){k,T), where cI){x,t) = J d^k(j){k,T)e^^'^ . Its 
ensemble mean and variance are {(l){k,T)) = and {(l){k,T)(p* (k' ,t)) = P^{k,T)6^{k — k'), where P^{k^T) 
is the power spectrum of the potential at time r, simply related to the density power spectrum by 
P^(fc,r) = {^7:Ga\T)-p{T)f k-^Pp{k,T) . 

We place the observer at the origin of coordinates and the primary lens Q on the z-axis. We use r to 
denote values of the z-coordinate, with zl and zs reserved for lens and source redshift, respectively. Note 
that the z-axis is only a coordinate axis used for reference and not the actual path of any light ray. We 
let n denote a unit vector in the photon's direction of motion and x denote its position. To first order in 
0, in the metric ([l|) they obey 



dn 
d^ 



V(/) - n(n • V0) , — = n(l + 20) . (3) 
J or 



In this and similar expressions in this section, cp is to be evaluated on the actual photon path, not on the 
z-axis. 

We now assume that the angle between ft and the z-axis is small (e.g. Seljak 1994| ), and consider the 
components perpendicular to the z-axis of n and x. They obey 

dn\ - , dxi ^ , 

_i = _2Vx<., ^=«x, (4) 

where 'V±(j) denotes the derivative of the potential transverse to the z-axis. In the approximation of 
small angles, these equations are the same as the Newtonian equations of motion for a particle moving 
in a gravitational field, except for the factor of 2 from General Relativity. The absolute mean of (p is 
not observable, since the perturbations in the metric are defined about the large-scale mean. Indeed, 
we may choose our space and time units so that the large-scale value of (p is zero at the Local Group. 
Then is a random variable with r.m.s. value of order 10~^ for the observed LSS power spectrum. 



^ "The lens" refers to some reference point in the lens plane, such as the center if the lens is axi-symmetric. 



- 5 - 



Equation (^) implies that the photon path obeys r(r) = tq — t with 0(0) corrections, where tq is the 
present value of r. The relation of comoving distance to redshift is, e.g. in an Einstein-de Sitter universe, 
r{z) = 2Hq^[1 - (1 + 2)-V2]. Thus comoving distances are simply related to the measured redshifts (to 
0((^)), and so we use comoving distances rather than angular diameter distances {ri and rs refer below 
to the lens and source, respectively). In a homogeneous universe with no LSS, angular diameter distances 
are given hy D = r/(l + z). In general, if an object at comoving distance r has a proper diameter R and 
is observed to subtend an angle 6, then the angular diameter distance is defined to be D = R/9. This 
differs from D in the homogeneous case by terms larger than O (</>). As explained in §1, this is precisely 
the effect which we calculate below, and so we discuss angular diameter distances further in §3. 

We can trace the photon trajectory backwards in time using equations (^), with the final conditions 
x± = and n± = nPi at the observer r = 0. Between the observer and the lens, we find that 



n±{T) 



.'■TO _ 

n° + 2 / V^(/.(r')dT' 



(ro - rX - 2 



(r' - T)V±^{T')dT' . 



(5) 



When the photon is at the lens, its direction of motion is n±{Ti). It is then deflected so that at the 
source side of the lens its direction of motion is p_\_ = n±{TL) + 7. The deflection angle 7 is evaluated at 
Xj_ = x±{tl), and is determined by the mass distribution of the primary lens. Equations (^ then imply 
that, between the lens and the source, 



X±{t) = X_i_{TL) - {tl - t)pj_ - 2 



(r' - T)V±(p{T')dT' 



(6) 



For a given source position Xj_, the lens equation is then x±{ts) = Xj_. 

The total proper time delay, relative to the = path along the z-axis, is given by (e.g. Schneider et 
al.[l99|) 



At 



TO 
TS 



dT - (1 + ZL)i^{xi) . 



(7) 



The first term is the geometrical time delay, the second is the potential time delay due to LSS, and the 
last is the potential time delay due to the lens, given by 



i^{x±) = 4G / d^e'S(C') log 



il + ZL)-^rL 



(8) 



where S(^) is the projected mass density of the lens, and ^ = {1 + zl) ^x± measures proper distance in 
the lens plane. We let r^s = ~ ''^L^ and then the scaled deflection angle is given by 



a = 7 = 

rs rs di 



(9) 
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3. Lensing in the presence of LSS 

Equations (||) and (|6[) cannot in general be solved analytically, since they involve integrals over the 
potential evaluated on the (unknown) photon path. We therefore expand (j) about its value on the 
Surpi et al. T995|) : 



z-axis as m 



4){rz + Xi_)^ct) + x^- Vx0 + y±f(p , (10) 

where the right-hand side (RHS) is evaluated on the z-axis. The second term on the RHS leads to an 
unobservable constant deflection, and the third to a relative deflection between light rays. Unlike Surpi 
et al., we include the third term. To lowest order, in the resulting expansion for W±(f) we substitute for x± 
the expressions given in equations (|5[) and (^ with (j) evaluated on the z-axis. Hereafter, all expressions 
involving (p are evaluated on the z-axis. 

The expansion ( p!o|) should be valid as long as the LSS power on scales smaller than the deflection 
x± is negligible. We find below, however, that the shear is produced by modes over a broad range of 
wavelengths. Moreover, the higher order terms in the expansion formally diverge at small wavelengths 
in an r.m.s. sense, e.g. for a scale-invariant spectrum, at fixed x±. In reality, x± depends on the initial 
direction and on (j). This worry is resolved by using a different expansion, equivalent to summing this 
entire series (see §7). This alternate expansion is convergent, and shows that the contribution of small 
wavelength modes is cut off. For strong lensing we find that the terms in equation ( [loD suffice for an 
accurate analysis. Note that we have not assumed at any point that 6 < 1 for the density. Our expansions 
remain valid even when we include non- linear modes for which 6^1. 

We are not interested in any deflection which is common to all light rays, since such a constant angle 
only affects the unobservable absolute position of the source. We can subtract out such terms to all orders 
simply by measuring displacements relative to some light ray instead of the z-axis. We define this fiducial 
ray as the light ray (null geodesic) passing through the observer and through the lens, and extended out 
to rs (see figure |l]). This ray is defiected by LSS throughout its path, but is not deflected by the primary 
lens. The quantities xj_ and n_L measured relative to the corresponding quantities for this fiducial ray 
we denote by d± and l±, respectively. Then 9 = —1^ is the observed angle of a light ray relative to the 
observed lens position. Note that = Xj_ = riX. 

We define dimensionless 2x2 symmetric tensors, 

Fij(ri,r2) = / {t - T2){ti - T)didj(p{T)dT , 

n - T2 Jt2 



n 

T2 



Gij{n,T2) = -2/ in - T)didj(l){T)dT . (11) 



The traceless part of Fjj is the shear tensor of weak lensing (e.g. Kaiser 1992| ). 
Between the observer and the lens. 



diir) = {To-T)\e' + ejr^{ro,T)] . (12) 
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Fig. 1. — Sketch showing the fiducial ray and an image ray, with distances in comoving coordinates. 



Equations (12) suggest a simple physical interpretation for the two tensors, in our approximation. For 
two rays that end up at the origin at tq with a small angular separation 0, (tq — T)¥ij6^ measures 
the change in their relative separation at r, compared to having no LSS. GijO^ similarly measures the 
induced change in their relative directions at time r. 

If we let pj_ = 1_\_{tl) + 7 (f^) then, between the lens and the source, 



= A + AG){rL,T) - d{{TL) [Gi{TL,T) + G;.(r, tl)\ /{tl - r) , 
d\{r) = di(rL)-(ri-T)[pi+piF^.(ri,r)]+4(ri)Gj(r,rz.). 

The lens equation is d_\_{Ts) = dj_. 

Defining /? = dj_/rs, and denoting e.g. F^^{tq,tl) by Fq^, the lens equation becomes 



(13) 



where a is evaluated at 



(14) 



(15) 



We thus conclude that to our order of approximation, LSS affects the lens equation through three terms, 
which are easily understood. Two rays separated by an angle 9^ at the observer would, in the absence of 
lensing or LSS, be separated by a (comoving) distance r^O'^ at the lens and rg^* at the source. The LSS 
shear changes these distances to rL{9'^ + OjFq^) and rsiO"^ + GjFQg) respectively. The deflection between 
the two rays by an angle —7* at the lens leads to an additional separation of —risl^ — ~^sct* the 
source, or —rs{a^ + OijF^lg) when we include the effect of LSS shear. There are no cross-terms between 
these three effects in our approximation, where only terms linear in 9 and a appear in d±. 



^Repeated indices are summed over the x and y directions. There is no distinction between upper and lower indices. 
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The magnification matrix is 

|| = S^J _ (_Fg^ + + ^IF'J^ + ^Fl^,) , (16) 

where = d^.d^^ip / {4:irGT,cr) is the shear matrix of the primary lens, in units of Y^cr = + 
zl) / {'^T^Gr ir Ls) 1 and in equation (|l6|) is evaluated at d^. With the usual sign conventions in lensing, 
the constant LSS shear is — Fq^^. This term would still be present even in the absence of the lens (see 
§7 below). Note that dp^/dOj is in general not symmetric, which it would be in the absence of LSS. In 
other words, LSS can rotate images. 

As noted in §1, we have also calculated the effect of LSS on angular diameter distances. Indeed, an 

— ♦ — y 

object which subtends an angle 9 at the observer measures a comoving distance dj_ on the lens plane, 
given by equation ([T5|). The same object measures a proper distance R = dj^/(l + zl), which follows 
(to 0{(p)) from the line element (||) taken at constant r. Then = Opj^Oj, where the angular diameter 
"distance" Dqj^ = {5^^ + FQj^)rL/{l + zl) is a tensor when LSS is present. Thus at a given 9 depends 
on orientation, and also R may have a different direction than 9, so when giving "distances" to lenses it is 
preferable to use the comoving distance which is well defined (up to corrections of 0{(j)), i.e. 0.01%) in 
terms of zl- As we show in §5, the components of Fql are of order a few percent, much larger than 0(0) 
corrections. Some other authors (e.g. Ehlers & Schneider 1986, Watanabe et al. 1992, Sasaki 1993) have 
also considered the effect of LSS on angular diameter distances, but they used an oversimplified model in 
which some fraction of the mass density in the universe is distributed in clumps. Theory and observation 
of LSS indicate that a description in terms of a random field with positive and negative fluctuations over 
a range of scales is more realistic (see also the related discussion in peljak 1994 ) . 



4. The Lens Equation and Time Delay 



The lens equation (|T4|) is similar in form to the generalized quadrupole lens of Kovner (1987). Kovner 
cosidered multiple lensing in which there is one primary lens and additional lenses with linear deflection 
laws. In that case Kovner showed how to write the lens equation in the form of a thin-lens equation, 
which simplifies the analysis of properties of the lens mapping, such as image multiplicities and the time 
delay between images. LSS is different in that the deflection is accumulated continuously, but the final 
result can be similarly simplified. Letting 



9^ + 9,Fl^, 



13^ 



the lens equation becomes 



yij _ 



X,F'J^ + a\X) 



^ OS ^ ^ LS ^ OL ' 



(17) 
(18) 

(19) 
(20) 



where we write a as a function of X rather than r^X. We find that 5^^ — F*g plays the same role as the 
"telescope matrix" of Kovner, which in his case is in general symmetric. We find a symmetric Feg only 
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because we are working to first order in the LSS shear. The effective shear F^g is in general significantly 
weaker than F^^, as we show in §5. Still, this shear should be of order 6% r.m.s., compared e.g. with a 
galaxy of ellipticity 20%, which produces a shear of 4%. 

We thus have a simple description of the lens mapping: The source plane is slightly distorted to 
become the Y plane, as given by equation ([l^), so e.g. a circular source appears elliptical in the Y plane. 



Equation (19) then gives the lens mapping from the Y plane to the X plane. Finally, the (observed) 



image plane is also a slightly distorted picture of the X plane, as in equation (17). Only the Y ^ X 
map is non-linear, and it determines the geometry of the lens mapping. Thus e.g. the probability of 
producing quads depends on the sum of Feg and any intrinsic asymmetric shear from the ellipticity of 
the lens galaxy. The shear Feg should tend to make the observed galaxy light distribution inconsistent 
with the observed lensing. If the lens is at high redshift, however, then the distortion in equation ([T7|) 
is also important, since 9 is observed and not X. In this case, the lens itself is distorted by LSS, if it is 
observed. Because this induced ellipticity is likely to be wrongly interpreted as intrinsic to the galaxy, 
it tends to confuse observers as to the actual direction of the galaxy's internal shear, but the effect is 
important only if the intrinsic ellipticity itself is not too large. Since the source plane is not directly 
observable, the distortion in equation (|T8| ) does not affect lens reconstruction, but it is important for 
absolute magnifications (given by equation (p^)), and for measuring shape distortions (§7). 

We can calculate the time delay explicitly using equation (0). However, it is easier to use Fermat's 
principle, which implies that (for a given <j){x,T)) the lens equation must be equivalent to dAt/d9 = 
at fixed /3 (e.g. [Schneider et al. 1992| ). Thus the time delay is the same as that corresponding to the 
thin-lens equation ( p!9| ) which, up to X-independent terms, equals (Kovner 1987) 

At = l^\{X- Yf - Y%X,X,] - (1 + ZLmvLX) . (21) 

We might have expected linear terms of the form O^Ci to make large contributions to At, where Cj is 
independent of 9 and (3, e.g. Cj = r^^s Sts ^«*^(''")'^''" ~ ^^Is Its ^'''^ ~ '^)9j4'iT)dT. Such terms do 

appear in the geometric and potential time delays, but Fermat's principle shows that they must drop out 
in the total time delay. These cancellations can also be demonstrated explicitly with equation (^). 

In addition to shear effects, the lens geometry is also affected by the trace parts of Fgff and Fql- These 
traces cannot be determined through lens reconstruction, since they only affect the unobservable overall 
scales of the lens size and distance. However, they do affect the determination of the Hubble constant 
from lensing. To show the various effects, we first set Fq^ = 0, and consider just equation ( p!9[ ) and the 
effect of the trace part of Feg . We also allow for a focusing term Kd due to a cluster surrounding the lens 
galaxy. Equation 5 of Narayan (1991) applies here: 

At tx a^VL , (22) 

where a represents some characteristic velocity of the lens system. The proportionality constant in this 
equation depends on a number of parameters which can in principle be determined for each pair of images 
from lens modelling. One of these parameters involves a and Kd, in the combination 



1 - ( ^TV Feff + ACcl 



(23) 
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(as follows from equation 6 of Narayan (1991)). Thus if At is measured (and a is not) then from the 
product C^^i we can try to deduce Hq given zi, zs, and an assumed deceleration parameter qq. The 
real Hq is different from the Hq deduced assuming k^i = Tr Feff = by a factor of [1 - (^Tr Feff + Kci)]- 
If both At and a can be measured independently, then as noted in Narayan (1991) we can (if Fq^ = 0) 
circumvent the unknowns in equation (^) and use equation (22) to obtain tl directly, and thus Hq for 
an assumed qq. 

We now add in also the effect of the trace part of Fql in equation (p!7|), which is an unobservable 
magnification of the lens plane produced by foreground structure. Since the time delay is proportional 
to the square of the angular scale (e.g. [Schneider et al. 1992| ), equation ( p2D is replaced by 

At oc (^1 + ^Tr Fol) . (24) 

Reasoning as above, we see that if only At is measured then (from equations ( p3|) and (p^) the real 
global Ho is different from the deduced Hq (for an assumed go) by a factor of [1 + ^Tr(2FoL — Fgfj) — Kd], 
to linear order. LSS thus produces a la uncertainty in determinations of Hq of 

aHo,i = r.m.s. of ^ Tr (2Fol - F,s) ■ (25) 



Contrary to Falco et al. (1985) , we cannot derive an upper bound on Hq from the At measurement since 
while Kci > 0, the LSS term may be negative or positive. Even with measurements of both At and cr^, 
we cannot measure Hq exactly, since when we use equation ( p^ ) we are subject (for a given go) to a la 
uncertainty of 

0-^0,2 = r-m.s. of Tr Fql ■ (26) 

Thus, LSS creates uncertainties in determinations of the Hubble constant from lensing which apply even to 
perfect lens models determined by an arbitrarily large number of observables. If a precise measurement 
of Hq is sought from lens time delays, then at least for some redshifts and LSS power spectra these 
uncertainties may not be very small, as we show in §5. 



5. LSS effects in realistic models 

We have shown above that the effects of LSS on lensing enter through the symmetric tensors Fql, 
Fls and Fqs- For a given lens and source, these tensors will affect the lens mapping as we showed in 
§4, possibly with observable effects which we illustrate in §6. In this section we estimate the typical 
magnitude of these tensors that is expected based on theory and observation of LSS, and its dependence 
on the redshifts of the lens and source. 

The tensor components are random variables of zero mean, with variances and covariances given in 
terms of the power spectrum of (j). For example, if ti > T2 > T3, then following the method of Kaiser 
(1992) we find that 

{Fi,{n,T2) FM{n,Ts)) = 27r2Q,,fc, rk\lk r (t - T2)(ri - r) (t - rs){n - ^) p^^^^^^^ (27) 

Jo Jt2 n -T2 Ti - T3 



1 2 3 4 5 




.5 1 1.5 .5 1 1.5 

Zl Zl 



Fig. 2. — The top plot shows the r.m.s. value of 2 Tr F function of 25, with zl set so that 

rL = ^rg. The bottom right plot shows the same quantity, but as a function of zl, for fixed zs = 3. The 
bottom left plot shows the r.m.s. value of ^ Tr Fol as a function of z^. All curves use the non-linear 
power spectrum, with = 1; h= 0.25, and erg = 0.8. 



3 if ijkl are all equal, 
where Q^j^i = ^ 1 if of ijkl two = x and two = y. 
otherwise. 

This assumes that the dominant contribution comes from modes with wavelengths that are much smaller 
than the distance Ti—T2- This is satisfied for standard forms of the power spectrum and relevant distances. 

We follow the approach of Seljak (1995) for calculating r.m.s. shear. For the linear power spectrum 
we take a scale- invariant spectrum with a CDM type transfer function ( Bardeen et al. 1986| ), which is 



normalized by ag, the mass fluctuation averaged over a sphere of radius 8h ^ Mpc, and whose peak is 
determined by O,moh. Galaxy and cluster surveys are consistent with as ~ 0.8 and ^moh ~ 0.25 (e.g. 



Peacock Sz Dodds (1994) ). We then find the non- linear power spectrum using the mapping proposed by 
Hamilton et al. (1991) and extended by Peacock & Dodds (1994), in the improved form of Jain et al. 
(1995), which they show agrees with N-body simulations at the relevant scales, for an = 1 universe. 
We find that the dominant contribution in equation ( [27| ) comes from wavenumbers A; ~ 3 h Mpc~^, with a 
broad range of two decades on each side contributing significantly. We therefore require a power spectrum 
that is accurate well into the non-linear regime. 

The shear due to Fqs is defined as F = ^(Fi)^ + (Fa)^ where Fi = ^{F^jg - Fgg) and F2 = F^^^. 
Equation (27) shows that F has the same r.m.s. value as ^ Tr Fqs, which is the convergence or surface 
mass density k due to Fqs- The same is true for Feg. Because each of the tensors Fql and F^s is 
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correlated (and so tends to be aligned) with FqSi Fefj tends to have smaller components than Fqs- For 
a given zs, the r.m.s. shear of Feg is maximized at ~ 60% of that of Fos, approximately at zl for which 
= ^rs- In the top plot of figure ^ we show the r.m.s. value of ^ Tr Feg as a function of zs, at this 
maximizing zl- In the bottom right plot we show the same quantity, but as a function of zl, for a fixed 
source at redshift 3. This quantity can be estimated for other redshifts through its scaling oc rLris/ \/rs, 
which is accurate to better than 10%. The bottom left plot shows the r.m.s. value of ^ Tr Fql as a 

3/2 1 

function of z^. This quantity scales approximately as oc rj/ , and also equals ^(Jho,2 as shown in §4. All 
curves use the non-linear power spectrum, which gives r.m.s. shear larger than the linear spectrum by a 
factor of « 2.5. Equation ( p7| ) gives a statistical tendency for perpendicular alignment between Fql and 
Feff, which increases (Jho,i relative to ^ Tr F^q. At = 3 and = \rs, cthq,! = 11-7%, and it scales 
approximately as oc r^y/rs — vl/S. Since the effect of LSS accumulates over distance, we find that the 
induced shears and time delay uncertainties are all smaller at lower redshifts. For the 0957+561 redshifts 
{zl = .36,2:5- = 1.41), the r.m.s. | Tr F^s is 3.7%, cthq,! = 5.9%, and crHo,2 = 3.3%. In addition, note 
that the effects of LSS on lens reconstruction disappear as zl — > 0, even if zs is large. 

The r.m.s. shear increases with cjg, in exact proportion for the linear power spectrum, faster for the 
non-linear spectrum. The r.m.s. shear also grows with h (at fixed a^), by ~ 35% for h = 0.5 compared to 
h = 0.25. As an additional example, tilted CDM (e.g. |Cen et al. 1992 ) with h = 0.5, erg = -6, and power 
spectrum index n = 0.8 lowers the shear by ~ 30% compared to Figure 2. The r.m.s. shear can also be 
calculated for models with 17^0 1 with modified formulas (see Appendix A). 



6. Illustration of the effect of LSS 

Kovner (1987) analyzed in some generality the properties of the lens mapping for an axi-symmetric 
lens perturbed by a weak shear. We simply wish to illustrate the possible observable effects of a shear 
of the magnitude that we obtained in §5. We choose a particular symmetric lens distribution, a singular 
isothermal sphere, with deflection law a{rLO) = 9/9. We use equations (|14|) - (16) to find the caustics 



and critical curves of the lens system. The critical curves are the points in the image plane for which 
the magnification det~^(9/3*/c?0j) is infinite, and the caustics are the corresponding points in the source 
plane. The caustics also determine image multiplicities, in that a source located outside all the caustics 
has a single image, and each time a source moves inside a caustic two images are added (except that for 
a singular surface density, one image is captured in the core when multiple images are produced). For 
a given source position, we can thus find all image positions, magnifications, and also time delays with 
equation (|2l]). 

The components F^^, etc. are random variables, with covariances obtained as in §5 above. We choose 
Zl = 0.78 and zg = 3.0, and take a particular example: 

_ / -3.87 0.50 \ _( -0.70 3.68 \^ ^ _( 6.65 -6.56 \ 

- 1^ 0.50 -2.04 j ' - 3.68 2.20 )^'^^'^^-[ _6.56 -0.31 ) ' 

Figure ^ shows the caustics in the source plane (upper panels) and critical curves in the image plane 
(lower panels), for the lens alone and for the lens plus LSS. For the latter case, the Y (distorted source) 



No LSS 



LSS In 



eluded 




Fig. 3. — Caustics in the source plane (upper panels) and critical curves in the image plane (lower panels) 
for a singular isothermal sphere, with no LSS, and with LSS. For the latter case, the Y (distorted source) 
plane and X (lens) plane are also shown. Also plotted are two source positions (marked + and x) and 
the corresponding images for each. A dot shows the 6 = (3 = position. 

plane and X (lens) plane are also shown. Also plotted are two source positions and the corresponding 
image positions. Table 1 lists the image positions, magnifications, and relative time delays. LSS changes 
the image configurations significantly. It displaces images from the line to the lens, in the two-image 
configuration, and also produces four-image systems when \(5\ is small. 



7. Weak Lensing and Strong Lensing 

The approximation of equation ( [lO|) suffices for consideration of strong lensing, where \9\ is very small 
(~ a few arcseconds). In weak lensing, however, the shear is observed at larger angles (arcminutes), 
and the variation of LSS shear with angle is important. Moreover, as noted above, there are potential 
convergence problems with our expansion of (j), even in the strong lensing case. Our formalism allows 
us to make a more rigorous and powerful expansion, and to study the transition from strong to weak 
lensing. Note that for shape distortions we must use P and 9 rather than Y or X, since we are interested 
in the observed compared to the intrinsic ellipticity of background galaxies. 

We replace equation ([lO| ) with 

OO -| 

4>irz + x^) = J2 -i^± ■ V±)Xrz) , (28) 

OTl. 

where this expression must be consistent with equations ^ and (^) for x±. We now include derivatives to 
all orders in equation (^), but only keep terms linear in cj). This means that in the following expansions 
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Source at /3 = (-0.30,0.30), Y = (-0.31,0.30) 




Image Plane {0) 


Lens Plane {X) 


Magnification 


Relative At 


No LSS 


(-1.00,1.00) 




3.36 


0.84 


(0.39, -0.39) 




-1.36 


With LSS 


(-1.30,1.05) 


(-1.24,1.03) 


2.82 


0.98 


(0.43, -0.46) 


(0.41,-0.45) 


-2.05 


Source at /? = F = (0.05, 0) 


No LSS 


(1.04,0) 




21.0 


0.099 


(-0.94,0) 




-19.0 


With LSS 


(1.10,-0.52) 


(1.06,-0.50) 


6.18 


0.098 
0.121 
0.165 


(-0.82,0.75) 


(-0.78,0.73) 


11.2 


(0.18,0.98) 


(0.17,0.96) 


-9.22 


(-0.68,-0.66) 


(-0.66,-0.65) 


-5.85 



Table 1: Positions of the images shown in figure 3. Also listed are the absolute magnifications (with a 
sign giving the image parity), and the time delay in units of rirs/ris relative to the earliest image to 
arrive at the observer. 



in powers of 9 and 7, we are finding each coefficient up to relative corrections of the same order as the 
r.m.s. LSS shear. 

Between the observer and the lens, equation (^) requires 



x±( 



(r) = -{to - T)nO - 2 / (r' - T)e^^("')-^^ Vx<^(r')dr' 



(29) 



where the exponential denotes the corresponding Taylor series expansion (and (p on the RHS is again 
evaluated on the z-axis). We can find all terms linear in (p in the solution by substituting for x±{t') in 
the RHS the (/>-independent term — (tq — r')nj_. We now find that 

r-To 



+ 2 



,(ro-T')e-Vj 



1 



TO 



r - T 



g{ro-r')e-Vx _ I 



d'(t>{T')dT' 



(30) 



If we let p± = 1±{tl) + then between the lens and the source we similarly find that 



TL 



d\{r) = d\{TL) - (tl - t)pI - 2 I {r'-r) 
The lens equation is 



d'(t>{T')dT' , 

(d^-(Ti-r')px)-V_L _ I 



d'(t){T')dT' 



(31) 



f -a' 



2 r° 



(t - Ts) 



j(To-r)e-Vx _ I 



d'^{T)dT 
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2 



(t - Ts) 



,({ro-T)0-(r£-T)7)-Vx 



5V(T)dT 



(32) 



with d and 7 evaluated at calculated from equation (^0[). The magnification matrix is 





where 



(33) 



A, 



B 



— / e(^«-^)^-^^(r - r5)(ro - r)5ia,</.(r)dr 
rs Jts 

9 , /-TO 



(-o-r)e-Vx(^ - ri)(To - r)afc9,<A(r)dr 



T.5 



If we expand the exponentials to first order in equation ( P2D and zeroth order in equation (|33|), we 
recover the results of §3. If lensing is not strong, Bjj is small, and in the external shear Aij we can set 
7 = (0 — (3)rs/rLs- In the limit of weak lensing, we can set 7 = to get 



- / e(^«^^)^-^MT - r5)(To - T)didjCt>{T)dT . 



(34) 



This expression can be used to calculate two point correlation functions of ellipticity. E.g., we can 
write down (TrA(0 = O)TrA(0)) and evaluate this expectation value in Fourier space. The exponential 
of irO ■ k (in Fourier space) oscillates rapidly at high k, which cuts off small wavelengths and prevents 
any divergence. The result, which is derived fully in Appendix B, agrees with previous analyses of weak 
lensing in the absence of a primary lens (e.g. [Blandford et al. 1991| ; [Miralda-Escude 1991| ; [Kaiser 1992| ). 
These analyses have found that the relative change in the angular correlation of ellipticity is smaller than 
10% (in an r.m.s. sense) for 6 less than about an arcminute. For the non-linear spectrum, we find this 



to be true below about 10" (see also Seljak 1995 ), thus justifying our keeping only linear terms in 6 for 
strong lensing. Our result (^) is more general than weak lensing, as it includes a primary lens {^'^^) and 
cross-terms (B^^). 



We can also get quadratic and higher-order terms in the gradients of 



Given a solution x^f^ we substitute it in the RHS of equation (|2^) and find the next order solution 



by iterating this procedure. 

■(j+i) 



The exponential of ik ■ xj_ ensures that small wavelength modes are cut off in the calculation of x j_ 
This corresponds to the physical intuition that on average (p{rz + x^f^) — (f){rz) is determined by power 
on scales of order |x|[^''|. If we calculate the r.m.s. shear at a point (i.e. for = 7 = 0) corresponding to 
^O'+i)^ for a given x^^(r), the answer is the same as for x~^ {t) = as long as the angular deflection is 
small and LSS power on the scale of rg is negligible. There is, of course a statistical correlation between 
r), but it is typically weak since x^^(r) is determined by the accumulated deflection from 



^^(^{t) and 
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T to To, a distance many times larger than the coherence length of LSS. The first correction to Fqs is a 
relative correction of order 1%, if (j) is Gaussian, and the corrections to Fqs are expected to be small also 
for a non-Gaussian (j) produced through hierarchical clustering. 

8. Conclusions 

We have shown that LSS can have significant effects on strong gravitational lensing. This suggests that 
lens reconstruction done without including LSS might reach incorrect conclusions about the distribution 
of matter in the lensing galaxy or cluster. It also raises the possibility of constraining the amplitude of 
the power spectrum directly, if lensing observations can be used to detect the effect of LSS. 

The effect of LSS is simply described by two symmetric tensors. Including only the effect of Feg, we 
find that the observed power spectrum of LSS requires the presence of an external shear of order 6% 
if zs = 3. This can significantly affect the cross sections for image multiplicities in lens systems. In 
particular, it can produce more images than would be created in the absence of LSS. This implies that 
in addition to the usual magnification bias, which increases the observed number of quads relative to 
doubles, there is a bias in quads toward lines of sight with relatively large effective shear from LSS. 

The second effect, given by Fol, produces a magnification and shear between the observer and the 
lens. This term enters the lens equation differently from the effective shear and should be included in 
lens modelling. It also distorts the lens plane, which contributes an ellipticity to the observed lens galaxy, 
and converts the angular diameter "distance" into a tensor, though the comoving distance is still simply 
defined in terms of the observed redshift. Even if lens reconstruction can model the lens potential exactly, 
we find that LSS creates an absolute uncertainty (ss 5 — 10% at la) in deductions of the Hubble constant 
from time delays. Among lens systems, the uncertainty is smaller for those with lower lens and source 
redshifts. 

Models of quadruple lenses typically find a shear of order 10% in addition to an axi-symmetric mass 
distribution. If this is due to the ellipticity of the lens galaxy, it may imply a larger ellipticity than that 
observed in the galaxy light distribution, as confirmed in a number of cases by recent observations. Since 
quads tend to be produced more easily when the shear due to the galaxy and the effective shear due 
to LSS are aligned, it is important to compare the magnitudes of the observed and modeled shears for 
consistency, and not only their directions. If the shear is due instead to other galaxies or clusters near the 
line of sight to the source, these additional lenses may not be found where expected. Only high-resolution 
observations and careful modelling of particular lens systems will determine if the shear may in part be 
due to LSS. When the parameters of many lenses are confidently known, it may become possible to 
study the redshift dependence of the shear. E.g., LSS does not affect lens reconstruction if the lens is 
at very low redshift. The original Einstein Cross 2237-1-0305 has zl = 0.04, zs = 1.7, and the lens light 
distribution seems to be consistent with the lensing mass (Rix et al. 1992). Other methods must be used 
to investigate independently whether the mass in galaxies is more flattened than the light distribution or 
not. E.g., an affirmative answer is suggested by an optical plus X-ray study of the elliptical galaxy NGC 
720 ( iBuote fc Canizares 199^) . 

Constraining the effects of LSS on strong lensing should complement observations of weak lensing due 
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to LSS. For measurements of weak lensing the sources are background galaxies, and the interpretation is 
compUcated by the unknown source redshift distribution, while for some strong lenses the redshifts of the 
lens and source are known. If the characteristic source redshift for weak lensing is ~ 0.7 — 1 then the shear 
due to LSS is significantly smaller than for strong lensing (e.g. figure ^). In addition, since measurements 
of weak lensing with high signal to noise require relativly large angular fields, the r.m.s. shear is further 
reduced. On the other hand, weak lensing due to LSS can in principle be distinguished from other effects 
by averaging over a wide field, an option not available in strong lensing. Demanding consistency between 
determinations of the effects in these two regimes should allow us to learn more about the distribution 
of matter in the universe. 

I thank Ed Bertschinger for suggesting this problem and for helpful advice, Uros Seljak for valuable 
discussions and for his computer program to calculate r.m.s. shear, Paul Schechter for helpful discussions 
and comments, and the referee Josh Prieman for valuable comments. This work was supported by NASA 
grant NAG5-2816. 



A. Appendix A 



To calculate LSS shear in a curved background requires slight modifications of our formulas (e.g. 
Miralda-Escude 1991; Seljak 1995| ). In a general Robertson- Walker model, the line element is 



aHr) 



[I + 2<j))dT'^ + (1 - 2(j))[dx^ + x{de^ + sin^ Od^"^)] 



(Al) 



in terms of spherical comoving coordinates, where we are now using the variable X = '^o — t- We have 
defined 



K-'^/'^smK^/^X ifK>0. 
sihk X= •{ X if K = 0. 

(_K)-i/2 sinh(-K) if K < 0. 



(A2) 



The curvature is = (ilo ~ l)-f^o- The relation between redshift and r is given by the Friedmann 
equation. 

In a curved geometry, a deflection by angle 69 at x' leads to a perpendicular displacement at x of 
6x± = dOsmxix ~ x')- our approximation of §3, these deflections simply add linearly. Thus, our 
expressions for xj_ or dj_ remain valid if we replace any expression of the form ri — T2 with sinx(ri — T2), 
so e.g. ris = sinxiTL — T5). Thus the lens equation (0), the magnification matrix (|^), and (again by 



Fermat's principle) the time delay (21) all have the same form except that now 

2 



Fij{Tl,T2) 



■ / sini^(r - T2) sini^(ri - T)didj(t){T)dT . 

■JT1 



(A3) 
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B. Appendix B 

In the limit of weak lensing with no primary lens, our formalism reproduces previously derived results. 
Prom equation we find 



(TrA(0 = O)TrA(e)) = \ P dn H dr2 e'^^'-^^nirs - r^rs - rz) (vic/>(Ti)Vi0(T2)) , 

Tg Jo Jo \ ' 

where we have used ri = tq — ti, etc. We convert to Fourier space, and use spherical coordinates 
{k, 9k, We use the approximation that only k values for which krs ^ 1 (i.e., wavelengths much 

smaller than the source distance) make an important contribution. This implies that /q dri f^^ dr2 ~ 
/q"^ dri f^^^ du, with u = r2 — ri, and also that we can set r2 = ri in the distance terms in the integrand. 
Letting oo = ku and denoting ri now by r, in Fourier space our expression becomes 

aJ'J dr J'^'^J duj J d^k e^^resine^cosci,,^iu^cose^ ^-^4^^ ^2 _ j^ip^^j^^^^ To - r) , 



where in the k integration we chose the x-axis in the direction of 0. Under the approximation of krg 3> 1, 

rkrs 
-krs 



/krs 
^^^i^cose, =2^5(cos0fc) . 
-krs 



Our expression thus equals 

2 i-oo /■27r 



Svr dr i^l - —j k^dkP^{k,T = tq - r) d(j)k e'^'^^""^^ , 
or, finally, 

rrs / r \ ^ f°° 

(TrA((9 = 0)TrA((9)) = levr^ J dr r^ j J k^dkP^{k, t = tq - r)Jo{kr0) . (Bl) 

This correlation function of Tr A equals that of twice the shear of A, which has also been derived 



previously through other methods (e.g. Blandford et al. 1991; Miralda-Escude 1991; Kaiser 1992). 
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